Safety-critical Policy Iteration Algorithm for Control under Model Uncertainty
نویسندگان
چکیده
Safety is an important aim in designing safe-critical systems. To design such systems, many policy iterative algorithms are introduced to find safe optimal controllers. Due the fact that most practical finding accurate information from system rather impossible, a new online training method presented this paper perform reinforcement learning based algorithm using real data instead of identifying dynamics. Also, impact model uncertainty examined on control Lyapunov functions (CLF) and barrier (CBF) dynamic limitations. The Sum Square program used iteratively solution. simulation results which applied quarter car show efficiency proposed fields optimality robustness.
منابع مشابه
Learning control under uncertainty: A probabilistic Value-Iteration approach
In this paper, we introduce a probabilistic version of the wellstudied Value-Iteration approach, i.e. Probabilistic Value-Iteration (PVI). The PVI approach can handle continuous states and actions in an episodic Reinforcement Learning (RL) setting, while using Gaussian Processes to model the state uncertainties. We further show, how the approach can be efficiently realized making it suitable fo...
متن کاملPolicy Iteration Algorithm for Shortest Path Problems
Abstract. The shortest paths tree problem consists in finding a spanning tree rooted at a given node, in a directed weighted graph, such that for each node i , the path of the tree which goes from i to the root has minimal weight. We propose an algorithm which is a deterministic version of Howard’s policy iteration scheme. We show that policy iteration is faster than the Bellman (or value itera...
متن کاملApproximate Policy Iteration for Markov Control Revisited
Q-Learning is based on value iteration and remains the most popular choice for solving Markov Decision Problems (MDPs) via reinforcement learning (RL), where the goal is to bypass the transition probabilities of the MDP. Approximate policy iteration (API) is another RL technique, not as widely used as Q-Learning, based on modified policy iteration. In this paper, we present and analyze an API a...
متن کاملPolicy iteration based feedback control
It is well known that stochastic control systems can be viewed as Markov decision processes (MDPs) with continuous state spaces. In this paper, we propose to apply the policy iteration approach in MDPs to the optimal control problem of stochastic systems. We first provide an optimality equation based on performance potentials and develop a policy iteration procedure. Then we apply policy iterat...
متن کاملAn Introduction to Hybrid Model Inventory Control with a Green Supplier Selection Model under Uncertainty
In the current decade, determining the most appropriate supplier as a strategic factor in the supply chain has attracted lots of consideration. On the other hand, organizations do necessary measures to implement green supply chain management in order to improve environmental and economic performance. An important way to implement green supply chain management, could be revising the method of pu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Artificial intelligence advances
سال: 2022
ISSN: ['2661-3220']
DOI: https://doi.org/10.30564/aia.v4i1.4361